On the use of GSV-SVM for Speaker Diarization and Tracking

نویسندگان

Viet Bac Le

Claude Barras

Marc Ferras

چکیده

In this paper, we present the use of Gaussian Supervectors with Support Vector Machines classifiers (GSV-SVM) in an acoustic speaker diarization and a speaker tracking system, compared with a standard Gaussian Mixture Model system based on adapted Universal Background Models (GMM-UBM). GSVSVM systems (which share the adaptation step with the GMMUBM systems) are observed to have comparable performances: for acoustic speaker diarization, the GMM-UBM system outperforms the GSV-SVM system on ESTER2 data but the latter system works better in the speaker tracking system. In particular, the linear combination of two systems at the score level outperforms each individual system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thinkit Audio Genre Classification System for Mirex08

This full abstract describes our submitted system for the MIREX08 Audio Genre Classification task, the goal of which is to discriminate music excerpts of different genres/styles. The system is based on basic feature of MFCC and modeling framework of GSV-SVM, which has been successfully applied in speaker recognition field. In this submission, the only basic feature we use is MFCC. And the goal ...

متن کامل

Experiments on speaker tracking and segmentation in radio broadcast news

In this paper we describe the speaker tracking and clustering system that we implemented for the ESTER evaluation campaign. We present some experiments on normalization in speaker tracking, in particular concerning the use of t-norm for speaker tracking in broadcast news. Results show that the use of t-norm significantly improves the performance at low false alarm rates. In a second part of the...

متن کامل

Modeling Overlapping Speech using Vector Taylor Series

Current speaker diarization systems typically fail to successfully assign multiple speakers speaking simultaneously. According to previous studies, overlapping errors account for a large proportion of the total errors in multi-party speech diarization. In this work, we propose a new approach using Vector Taylor Series (VTS) to obtain overlapping speech models assuming individual speaker models ...

متن کامل

Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study

A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main components of the system to the overall speaker-tracking performance are evaluated. The process of speaker tracking in continuous audio streams involves several processing tasks and is therefore treated as a multistage process. The main building blocks of such system include the components for au...

متن کامل

Variability compensated support vector machines applied to speaker verification

Speaker verification using SVMs has proven successful, specifically using the GSV Kernel [1] with nuisance attribute projection (NAP) [2]. Also, the recent popularity and success of joint factor analysis [3] has led to promising attempts to use speaker factors directly as SVM features [4]. NAP projection and the use of speaker factors with SVMs are methods of handling variability in SVM speaker...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

On the use of GSV-SVM for Speaker Diarization and Tracking

نویسندگان

چکیده

منابع مشابه

Thinkit Audio Genre Classification System for Mirex08

Experiments on speaker tracking and segmentation in radio broadcast news

Modeling Overlapping Speech using Vector Taylor Series

Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study

Variability compensated support vector machines applied to speaker verification

عنوان ژورنال:

اشتراک گذاری